Pilot Project on Speech Recognition using Layered Abduction and Multiple Knowledge Types

نویسندگان

  • John R. Josephson
  • M. Beckman
  • R. Fox
  • A. Krishnamurthy
  • T. Patten
  • L. Feth
  • B. Chandrasekaran
چکیده

Our long-term goal is to test the feasibility of basing a recognition system on a model of human speech processing. Key hypotheses are: (1) Computational models for reasoning from incomplete knowledge provide a useful metaphor for many aspects of human speech understanding even at levels where highly automatic perceptual processes are at work; (2) A computational model of human cochlear processing must be used as the signal-processing front end; (3) Some representation of articulation should mediate between the acoustics and the phonology in order to accommodate contextual variation of various sorts; (4) The phonological representation must encode the prosodic structure and intonation pattern as well as the phoneme string. Accordingly our specific short-term goal is to build a small prototype system that uses a layered-abduction architecture, in which there are many stages of processing, corresponding to different levels of knowledge. The common informationprocessing task at each stage is to form a coherent, composite (multi-part) hypothesis that explains the data presented from the preceding levels. The input signal will be the digitized speech processed by Patterson's Stabilized Auditory Image system. An articulatory representation based on Browman and Goldstein's gestural score will mediate between the auditory representation and the phonology, and Pierrehumbert and Beckman's prosodic tree and tone string will be used for the phonological organization and intonational melody.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P65: Speech Recognition Based on Bbrain Signals by the Quantum Support Vector Machine for Inflammatory Patient ALS

People communicate with each other by exchanging verbal and visual expressions. However, paralyzed patients with various neurological diseases such as amyotrophic lateral sclerosis and cerebral ischemia have difficulties in daily communications because they cannot control their body voluntarily. In this context, brain-computer interface (BCI) has been studied as a tool of communication for thes...

متن کامل

Layered Abduction For Speech Recognition From Articulation: Artrec

monious covering as a method for natural language interfaces to expert systems. Subdivision of the audible frequency range into critical bands. In Journal of Acoustical Society of America, volume volume 33.

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

The Effect of Colligational Corpus-based Instruction on Enhancing the Pragmalinguistic Knowledge of Request Speech Act among Iranian Intermediate EFL Learners

This study investigated the effectiveness of colligational corpus-based instruction on enhancing the pragmalinguistic knowledge of speech act of request among Iranian intermediate EFL learners. The objective of the study was to find out whether or not providing students with corpora through using colligational instruction had any significant effects on enhancing their pragmalinguistic knowledge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989